DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5 6 7 8 9...21
Hits 81 – 100 of 411

81
Result Diversity and Entity Ranking Experiments: Anchors, Links, Text and Wikipedia
In: DTIC (2009)
Abstract: In this paper, we document our efforts in participating to the TREC 2009 Entity Ranking and Web Tracks. We had multiple aims: For the Web Track's Adhoc task we experiment with document text and anchor text representation, and the use of the link structure. For the Web Track's Diversity task we experiment with using a top down sliding window that, given the top ranked documents, chooses as the next ranked document the one that has the most unique terms or links. We test our sliding-window method on a standard document text index and an index of propagated anchor texts. We also experiment with extreme query expansions by taking the top n results of the initial ranking as multi-faceted aspects of the topic to construct n relevance models to obtain n sets of results. A final diverse set of results is obtained by merging the n results lists. For the Entity Ranking Track, we also explore the effectiveness of the anchor text representation, look at the co-citation graph, and experiment with using Wikipedia as a pivot. Our main findings can be summarized as follows: Anchor text is very effective for diversity. It gives high early precision and the results cover more relevant sub-topics than the document text index. Our baseline runs have low diversity, which limits the possible impact of the sliding window approach. New link information seems more effective for diversifying text-based search results than the amount of unique terms added by a document. In the entity ranking task, anchor text finds few primary pages , but it does retrieve a large number of relevant pages. Using Wikipedia as a pivot results in large gains of P10 and NDCG when only primary pages are considered. Although the links between the Wikipedia entities and pages in the Clueweb collection are sparse, the precision of the existing links is very high. ; Presented at the Text REtrieval Conference (TREC 2009, 18th) held in Gaithersburg, Maryland on 17-20 November 2009. Published in the Proceedings of the Text REtrieval Conference (TREC 2009, 18th), 2009. The conference was co-sponsored by the National Institute of Standards and Technology (NIST), the Defense Advanced Research Projects Agency (DARPA), and the Advanced Research and Development Activity (ARDA).
Keyword: *CLUEWEB; *INFORMATION RETRIEVAL; *INTERNET; *NDCG(NORMALIZED DISCOUNTED CUMULATIVE GAIN); *SEMANTICS; *TEXT PROCESSING; *WEB TRACK COMPUTER SYSTEM; ANCHOR TEXT; BASE LINES; CATEGORY MAPPING; Computer Programming and Software; Equipment and Methods; FOREIGN REPORTS; INFORMATION PROCESSING; Information Science; INTERNET BROWSERS; Linguistics; LINK FILTERS; NETHERLANDS; PIVOTS; REPRINTS; SLIDING; SLIDING WINDOWS; SOCIAL COMMUNICATION; SYMPOSIA; TERM FILTERS; Test Facilities; TEXTBOOKS; WIKIPEDIA; WINDOWS
URL: http://www.dtic.mil/docs/citations/ADA517853
http://oai.dtic.mil/oai/oai?&verb=getRecord&metadataPrefix=html&identifier=ADA517853
BASE
Hide details
82
PRIS at 2009 Relevance Feedback track: Experiments in Language Model for Relevance Feedback
In: DTIC (2009)
BASE
Show details
83
A Study of Faceted Blog Distillation -- PRIS at TREC 2009 Blog Track
In: DTIC (2009)
BASE
Show details
84
The Synthetic Teammate Project
In: DTIC (2009)
BASE
Show details
85
Formulating Simple Structured Queries using Temporal and Distributional Cues in Patents
In: DTIC (2009)
BASE
Show details
86
Experiments on Related Entity Finding Track at TREC 2009
In: DTIC (2009)
BASE
Show details
87
THUIR at TREC 2009 Web Track: Finding Relevant and Diverse Results for Large Scale Web Search
In: DTIC (2009)
BASE
Show details
88
PARADISE Based Search Engine at TREC 2009 Web Track
In: DTIC (2009)
BASE
Show details
89
Microsoft Research at TREC 2009. Web and Relevance Feedback Tracks
In: DTIC (2009)
BASE
Show details
90
FEUP at TREC 2009 Blog Track: Temporal Evidence in the Faceted Blog Distillation Task
In: DTIC (2009)
BASE
Show details
91
A Comparison of Query-by-Example Methods for Spoken Term Detection
In: DTIC (2009)
BASE
Show details
92
The Multi-Session Audio Research Project (MARP) Corpus: Goals, Design and Initial Findings
In: DTIC (2009)
BASE
Show details
93
Long Term Examination of Intra-Session and Inter-Session Speaker Variability
In: DTIC (2009)
BASE
Show details
94
UDEL/SMU at TREC 2009 Entity Track
In: DTIC (2009)
BASE
Show details
95
Perturbation and Pitch Normalization as Enhancements to Speaker Recognition
In: DTIC (2009)
BASE
Show details
96
Long Term Examination of Intra-Session and Inter-Session Speaker Variability
In: DTIC (2009)
BASE
Show details
97
Finding Related Entities by Retrieving Relations: UIUC at TREC 2009 Entity Track
In: DTIC (2009)
BASE
Show details
98
Facet Classification of Blogs: Know-Center at the TREC 2009 Blog Distillation Task
In: DTIC (2009)
BASE
Show details
99
Patent Retrieval in Chemistry based on Semantically Tagged Named Entities
In: DTIC (2009)
BASE
Show details
100
A Hybrid Method for Opinion Finding Task (KUNLP at TREC 2008 Blog Track)
In: DTIC (2008)
BASE
Show details

Page: 1 2 3 4 5 6 7 8 9...21

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
411
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern